Indexing Cost Sensitive Prediction
نویسندگان
چکیده
Predictive models are often used for real-time decision making. However, typical machine learning techniques ignore feature evaluation cost, and focus solely on the accuracy of the machine learning models obtained utilizing all the features available. We develop algorithms and indexes to support cost-sensitive prediction, i.e., making decisions using machine learning models taking feature evaluation cost into account. Given an item and a online computation cost (i.e., time) budget, we present two approaches to return an appropriately chosen machine learning model that will run within the specified time on the given item. The first approach returns the optimal machine learning model, i.e., one with the highest accuracy, that runs within the specified time, but requires significant up-front precomputation time. The second approach returns a possibly suboptimal machine learning model, but requires little up-front precomputation time. We study these two algorithms in detail and characterize the scenarios (using real and synthetic data) in which each performs well. Unlike prior work that focuses on a narrow domain or a specific algorithm, our techniques are very general: they apply to any cost-sensitive prediction scenario on any machine learning algorithm.
منابع مشابه
Indexing Cost Sensitive Prediction
Predictive models are often used for real-time decision making. However, typical machine learning techniques ignore feature evaluation cost, and focus solely on the accuracy of the machine learning models obtained utilizing all the features available. We develop algorithms and indexes to support cost-sensitive prediction, i.e., making decisions using machine learning models taking feature evalu...
متن کاملData Mining Based Predictive Models for Overall Health Indices
In this study, we infer health care indices of individuals using their pharmacy medical and prescription claims. Specifically, we focus on the widely used Charlson Index. We use data mining techniques to formulate the problem of classifying Charlson Index (CI) and build predictive models to predict individual health index score. First, we present comparative analyses of several classification a...
متن کاملIndexing for Vertical Search Engine: Cost Sensitive
The information on the WWW is growing exponentially and the dynamic, unstructured data & structured data needs to locate as useful resources, web pages and online database in enormous quantity. In this paper we propose the novel indexing technique to download the hidden web pages which is based on domain specific. This technique keeps the related documents in the same domain so that searching o...
متن کاملA New Sensitive Method for Detection of Viroids
Background and Aims: Viroids are smallest known plant pathogens and cause several economically significant diseases. Until recently, viroid detection relied mainly on biological tests and indexing. Today various diagnostic techniques such as nucleic acid hybridization, southern blot and reverse transcription coupled with polymerase chain reaction (RT-PCR) are being used for detection and diag...
متن کاملTo Customize or Not to Customize? The Use of a Customization Tool to Augment Information Indexing in a Computer Desktop Environment
We studied when and how people will use a customization tool that helps users offload information indexing to the external environment to augment finding and re-finding of information in a computer desktop environment. An experiment was conducted to study how the cost and benefit of customization may influence when and how participants customize, and how the customization may help them find and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1408.4072 شماره
صفحات -
تاریخ انتشار 2014